Multi-space random mapping for speaker identification
نویسندگان
چکیده
This paper presents a work of utilizing multi-space random mapping (MRM) to formulate a dual-factor identification system, which combines speaker biometric and personal token. Personal token will be assigned to the client to constitute a unique random subspace during enrollment and the speaker template will be generated within the random subspace. Test features will be mapped to the random subspace that is described by the personal token presented during identification. Our work has shown that MRM-system exhibits stronger discriminative ability when comparing test features to its counterfeit templates, which lied in other different random subspaces. This advantage thus contributes to better F-ratio and greater accuracy recognition. Experiments on YOHO corpus demonstrate a remarkable result where the system achieves the perfect identification rate.
منابع مشابه
Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution
In our previous work, we have proposed a speaker modeling technique using spectral and pitch features for text-independent speaker identification based on Multi-Space Probability Distribution Gaussian Mixture Models (MSD-GMMs). We have presented a maximum likelihood (ML) estimation procedure for the MSD-GMM parameters and demonstrated its high recognition performance. In this paper, we describe...
متن کاملSpeaker identification with dual penalized logistic regression machine
This paper proposes a novel speaker identification method based on the dual Penalized Logistic Regression Machine (dPLRM) for general multi-class discrimination. The machine employs kernel functions which implicitly map an acoustic feature space to a higher dimensional space. Each speaker is discriminatively identified in this space implicitly. The penalized logistic regression model used in dP...
متن کاملSpeaker identification using Gaussian mixture models based on multi-space probability distribution
This paper presents a new approach to modeling speech spectra and pitch for text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution (MSD-GMM). The MSD-GMM allows us to model continuous pitch values for voiced frames and discrete symbols representing unvoiced frames in a unified framework. Spectral and pitch features are jointly modeled...
متن کاملStructured prediction for speaker identification in TV series
Though radio and TV broadcast are highly structured documents, state-of-the-art speaker identification algorithms do not take advantage of this information to improve prediction performance: speech turns are usually identified independently from each other, using unstructured multi-class classification approaches. In this work, we propose to address speaker identification as a sequence labeling...
متن کاملCluster and Intrinsic Dimensionality Analysis of the Modified Group Delay Feature for Speaker Classification
Speakers are generally identified by using features derived from the Fourier transform magnitude. The Modified group delay feature(MODGDF) derived from the Fourier transform phase has been used effectively for speaker recognition in our previous efforts.Although the efficacy of the MODGDF as an alternative to the MFCC is yet to be established, it has been shown in our earlier work that composit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEICE Electronic Express
دوره 2 شماره
صفحات -
تاریخ انتشار 2005